On the Challenging Instances of the Planted Motif Problem
نویسندگان
چکیده
A classic problem of motif discovery in DNA sequences, called the Planted (l, d)-Motif Problem has been widely studied over the past decade owing to its application in identifying vital signals such as transcription factor binding sites. Challenging instances of the problem are those that have been probabilistically proved as ‘difficult to be solved’ due to the existence of several motifs by random chance for such instances. In this work, we present an expected case analysis that helps understanding the real ‘challenging instances’ of the problem.
منابع مشابه
Generalized Planted (l, d)-Motif Problem with Negative Set
Finding similar patterns (motifs) in a set of sequences is an important problem in Computational Molecular Biology. Pevzner and Sze [18] defined the planted (l,d)-motif problem as trying to find a lengthl pattern that occurs in each input sequence with at most d substitutions. When d is large, this problem is difficult to solve because the input sequences do not contain enough information on th...
متن کاملqPMS7: A Fast Algorithm for Finding (ℓ, d)-Motifs in DNA and Protein Sequences
Detection of rare events happening in a set of DNA/protein sequences could lead to new biological discoveries. One kind of such rare events is the presence of patterns called motifs in DNA/protein sequences. Finding motifs is a challenging problem since the general version of motif search has been proven to be intractable. Motifs discovery is an important problem in biology. For example, it is ...
متن کاملExact Algorithms for Planted Motif Problems CONTACT AUTHOR:
The problem of identifying meaningful patterns (i.e., motifs) from biological data has been studied extensively due to its paramount importance. Three versions of this problem have been identified in the literature. One of these three problems is the planted (l, d)-motif problem. Several instances of this problem have been posed as a challenge. Numerous algorithms have been proposed in the lite...
متن کاملSpace and Time Efficient Algorithms for Planted Motif Search
We consider the (l, d) Planted Motif Search Problem, a problem that arises from the need to find transcription factor-binding sites in genomic information. We propose the algorithms PMSi and PMSP which are based on ideas considered in PMS1 [6]. These algorithms are exact, make use of less space than the known exact algorithms such as PMS and are able to tackle instances with large values of d. ...
متن کاملGamot: an Efficient Genetic Algorithm for Finding Challenging Motifs in Dna Sequences
Weak signals that mark transcription factor binding sites involved in gene regulation are considered to be challenging motifs. Identifying these motifs in unaligned DNA sequences is a computationally hard problem which requires efficient algorithms. Genetic Algorithms (GA), inspired from evolution in nature, are a class of stochastic search algorithms which have been applied successfully to man...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006